Systems and methods for intelligent call transcription

ABSTRACT

Intelligent call transcript systems and methods are disclosed. An intelligent call transcript system can create a transcript of a telephonic communication and supplement the transcript with additional information automatically or upon request. Additional information may be added when key words are detected, such as adding acronym expansion when an acronym is detected or adding identifying information to an important task when mention of the task is detected. Portions of the transcript may also be sent as messages, text-based or audio, upon detection of key words or at the instruction of a user or device.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of, and claims priority to, U.S.patent application Ser. No. 12/340,297, filed Dec. 19, 2008, entitled“Systems and Methods for Intelligent Call Transcription” the entirecontents of which are hereby incorporated herein by reference.

BACKGROUND

During telephone conversations, including conference calls, participantsin the conversation may often find it useful to have a transcript of theconversation. Other interested parties who were not participants in thecall may also find it useful to have a transcript. While transcriptiontechnology has advanced in recent years, it can be difficult to get thecomplete meaning of words spoken without hearing the speaker's voice,including characteristics such as the tone, inflection, emphasis, andother qualities of the spoken words.

Often during a telephonic communication, such as a conference call,items will be discussed that need to be addressed or remembered at alater time. It can be difficult to identify these items at a later time,even with a transcript of the communication, especially if theconversation was over an extended period of time, resulting in a lengthytranscript. This is even more of a problem if the person reviewing thetranscript was not participating in the telephonic communication. Inthese and other situations there exists a need to improve tofunctionality and usefulness of transcription systems.

SUMMARY

Systems and methods are provided herein for intelligent call transcriptsystems and methods. A telephonic or other audible communication may betranscribed into text by using the systems and methods described. Eventsmay be detected that trigger actions taken by the system. Events mayinclude the detection of key words, phrases, or other sounds. Events mayalso include the receipt of commands or instructions by the system. Theaction taken in response to the detections of an event may includeadding supplemental information to the transcript. Such information mayinclude translations, expansion of acronyms, links to other information,links to audio recordings of the communication, tags, indicators ofimportant portions of the transcript, pictures, emoticons, video, or anyother information that may be of interest.

Other actions taken may include sending a message with a portion of thetranscribed text to a user or device. Alternatively, a voice message maybe generated and transmitted containing an audio portion of thetelephonic communication.

These and other features and advantages of various exemplary embodimentsof systems and methods according to this disclosure are described in, orare apparent from, the following detailed description of variousexemplary embodiments of the systems and methods of this subject matter.

BRIEF DESCRIPTION OF THE DRAWINGS

Various exemplary embodiments of this invention will be described indetail, with reference to the following figures, wherein like numeralsrepresent like elements, and wherein:

FIG. 1 is a block diagram of a non-limiting, exemplary intelligent calltranscript system including communications devices, a network, andusers.

FIG. 2A is an illustration of a non-limiting, exemplary embodiment of anintelligent call transcript.

FIG. 2B is an illustration of another non-limiting, exemplary embodimentof an intelligent call transcript sent as a message.

FIG. 3 is a flow chart of non-limiting, exemplary method of implementingan intelligent call transcript system.

FIG. 4 is a block diagram of a non-limiting, exemplary wireless devicethat may be used in connection with an embodiment.

FIG. 5 is a block diagram of a non-limiting, exemplary processor inwhich the present subject matter may be implemented.

FIG. 6 is an overall block diagram of an exemplary packet-based mobilecellular network environment, such as a GPRS network, in which thepresent subject matter may be implemented.

FIG. 7 illustrates a non-limiting, exemplary architecture of a typicalGPRS network as segmented into four groups.

FIG. 8 illustrates a non-limiting alternate block diagram of anexemplary GSM/GPRS/IP multimedia network architecture in which thepresent subject matter may be implemented.

DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS

The subject matter of the disclosed embodiments is described withspecificity to meet statutory requirements. However, the descriptionitself is not intended to limit the scope of this patent. Rather, theinventors have contemplated that the claimed subject matter might alsobe embodied in other ways, to include different steps or elementssimilar to the ones described in this document, in conjunction withother present or future technologies. Moreover, although the term “step”may be used herein to connote different aspects of methods employed, theterm should not be interpreted as implying any particular order among orbetween various steps herein disclosed unless and except when the orderof individual steps is explicitly described.

In the discussion that follows, details relating to telephony devicesand networks are assumed to be known to those skilled in the art.Accordingly, such details are largely omitted herein for the sake ofclarity and explanation. In addition, any references herein to anexample embodiment involving a cellular, wireless, or mobile telephoneare used solely for purposes of explanation, and are not intended tolimit the invention to any such embodiment. For example, a wirelessdevice as contemplated by various embodiments of the invention mayinclude, but is not limited to: cellular telephones, personal digitalassistants (PDAs), email devices, portable and non-portable computers,and the like. The wireless device may operate in a cellular, SMR, PCS,cordless, unlicensed AWS, 700 MHz, or other spectrums. Furthermore,embodiments are not limited by the network servicing the device.Accordingly, embodiments may be applicable to any network typeincluding, for example, TDMA, CDMA, WCDMA, GSM, WiFi, WiMAX, OFDM, UMTS,EV-DO, HSDPA/HSUPA, and other standards now known or to be developed inthe future.

FIG. 1 illustrates a system 100 including communications devices, anetwork, and a network device that may be in use during a telephoniccommunication. User 101 may be participating in a telephoniccommunication with users 102 and 103. This may be a three-way call, aconference call, or any other form of voice communications. It iscontemplated that any number of users, including one user, may beparticipating in a telephonic communication with any other users and/ordevices. For example, a user may be communicating with an automatedsystem rather than one or more other users.

User 101 may be operating mobile communications device 111 as a means toparticipate in the telephonic communication. Mobile communicationsdevice 111 may be any type of mobile communications device as describedherein, or any device capable of wirelessly facilitating telephoniccommunications. For example, mobile communications device 111 may be amobile, wireless, and/or cellular telephone of any type, a personal dataassistant (PDA), a laptop or other mobile computer, or any other mobiletelephony-capable device.

Similarly, user 103 may be using mobile communications device 113 toparticipate in the telephonic communication. Mobile communicationsdevice 113 may also be any type of mobile communications device asdescribed herein, or any device capable of wirelessly facilitatingtelephonic communications. For example, mobile communications device 113may be a cellular telephone, a mobile telephone of any type, a personaldata assistant (PDA), a laptop or other mobile computer, or any othermobile telephony-capable device.

User 102 may be using wired communications device 112 to participate inthe telephonic communication. Wired communications device 112 may be anytype of wired communications device capable of facilitating telephoniccommunications, including a traditional telephone, an internet protocol(IP) telephone, a digital telephone, a computer of any type configuredto facilitate telephonic communications, or any other wiredtelephony-capable device.

Users 101, 102, and 103, through the operation of devices 111, 112, and113, may be communicating through network 110. Network 110 may be one ormore wired and/or wireless network of any type, non-limiting examples ofwhich are described herein, or any combinations thereof. Any form oftelecommunications, voice, or data network, or any combination of suchnetworks, may be present in network 110.

Network device 104 may be communicatively connected to network 110, maybe a subsystem of network 110, and may be operated, owned, and/ormaintained by the operator of network 110 or by a third party. Networkdevice 104 may or may not be geographically located in the same room,building, region, or area as other devices that make up network 110.Network device 104 may facilitate the telephonic communication betweenuser 101, 102, and 103. Network device 104 may provide telephonyservices as known to those skilled in the art. For example, networkdevice 104 may function as a private branch exchange (PBX). In oneembodiment, network device 104 provides intelligent call transcriptionservices as described herein.

Network device 104 may be configured to transcribe the telephoniccommunication between users 101, 102, and 103. Network device 104 mayuse any transcription method or means that can convert the spoken wordsor other sounds detected in the telephonic communication into text. Anyautomated or computerized speech recognition system, means, or methodsmay be employed to transcribe the telephonic communication. Networkdevice 104 may also be configured to perform other network tasks, or maybe configured only for intelligent transcription of telephoniccommunications. Transcription may be performed using any means known tothose skilled in the art, and may be performed by default, for examplewhen used in conjunction with a conference call system, or may beperform at the instruction of a user or device. The transcribedtelephonic communication may be provided in any form of transcript,including electronic documents and other electronic forms of textpresentation. Such electronic transcripts may include software and datathat allow a user, device, or software to manipulate, alter, and/orperform further processing on the transcript. Electronic versions of thetranscript may also integrate software and/or data such that thetranscript itself is an executable software application.

In addition to transcribing the telephonic communication, network device104 may record and/or store the communication on one or more computerstorage devices, such as a hard disk drive or random access memory,and/or on magnetic or digital tape, disks, or other media, or any othereffective recording or storage means. If the telephonic communication isstored in some manner, transcription may be performed before, during, orafter the telephonic communication is stored.

Network device 104 may also be configured to accept instructions and/orcommands from users and/or other network devices directly and/or throughnetwork 110. Such instructions may be received through one or moreinterfaces, such as a web page, an application, or an applicationprogramming interface. These interfaces may be used in conjunction withother devices and/or software that allow a user and/or another device toissue instructions to network device 104. Instructions may also betransmitted to network device 104 through the use of tone generated by atelephone, such as dual-tone multi-frequency (DTMF) tones. Instructionsmay also be detected by network device 104 through voice or speechrecognition. Instructions may also be detected or received by one ormore other devices that, responsive to receiving or detecting theinstruction, transmit one or more instructions to network device 104. Insome embodiments, network device may only accept or recognizeinstructions from approved or designated users or devices. For example,a designated chairperson in a conference call may be the only userpermitted to instruct network device 104 to begin transcribing atelephonic communication. All other means and methods of issuing andreceiving instructions on a device and all other methods and means ofrestricting the ability of users and devices to issue instructions to adevice are contemplated as within the scope of the present disclosure.

In one embodiment, network device 104 transcribes the entire telephoniccommunication. In an alternative embodiment, network device 104 maytranscribe only a portion of the telephonic communication. Networkdevice 104 may receive a command or instruction from a user or deviceinstructing network device 104 when to begin and/or when to stoptranscribing a telephonic communication. For example, user 101 mayoperate mobile communications device 111 by pressing a key on a key padof mobile communications device 111, sending a tone to network 110.Network 110 may transmit this tone to network device 104, which may beconfigured to recognize such tones and perform actions in response tosuch tones. Alternatively, network 110 and/or one or more devices withinnetwork 110 may be configured to detect such tones and, in response todetecting the tone, transmit an instruction or command to network device104.

In another alternative, network device 104 may be configured to detectcertain voice commands or speech that network device 104 may interpretas a command or instructions. For example, user 101 may say “begintranscription” during the telephonic communication, which network device104 may recognize as a command and begin transcription. Alternatively,network 110 and/or one or more devices within network 110 may beconfigured to detect such voice commands and, in response to detectingthe voice command, transmit an instruction or command to network device104.

In one embodiment, a command may be received or detected that instructsnetwork device 104 to stream the transcript as it is created to one ormore designated or default devices or users. Such a feature may behelpful to those users who are in an area where listening to aconversation is difficult, such as an area with loud noise present orwhere absolute silence is requires. In such an embodiment, thetranscript may be streamed as it is created, or the transcript may bestreamed at a point in the future for later reference. Other embodimentsof streaming transcripts are contemplated.

In still another alternative, network device 104 may receiveinstructions to begin and/or end transcription through an interface. Forexample, user 103 may operate a web browser on mobile communicationsdevice 113, or on a separate device, which displays a web interface tonetwork device 104, or to another device that is communicativelyconnected to network device 104. This interface may allow user 103 toinstruct network device 104 to begin transcribing the telephoniccommunication. In another embodiment, network device 104 may receive aninstruction to enter a mode in which instructions are received regardingtranscription commands. For example, a user may send a tone command tonetwork device 104 by pressing the ‘*’ key. Network device 104 mayrecognize this as an instruction to enter transcription command mode.Thereafter, network device 104 may begin recognizing command orinstructions, such as voice commands, that affect call transcription.All other means or methods of instructing network device 104 to beginand/or end transcription of a telephonic communication, and all othermeans and methods of instructing network device 104 when and if toaccept transcription commands, are contemplated as within the scope ofthe present disclosure.

In one embodiment, network device 104 may record, or store, thetelephonic communication during, before, or after the transcription. Inanother embodiment, network device 104 may maintain a temporaryrecording of the telephonic communication as it is ongoing, in a bufferor similar storage means, and move the temporarily stored telephoniccommunication to permanent storage upon instruction received from auser, administrator, or device. In some embodiments, network device 104may maintain the entire telephonic communication in a buffer. In otherembodiments, network device 104 may maintain only a preconfigured amountof the telephonic communication in a buffer, for example the most recentten minutes, allowing a user to direct network device 104 to move thebuffered amount of the telephonic communication to permanent storage asdesired. Other means and methods of selecting or directing portions orthe entirety of a telephonic communication to be stored permanently arecontemplated as within the scope of the present disclosure.

Network device 104 may correlate the transcribed text with the recordingtime of the recorded telephonic communication and insert one or moretags into the transcript based on the recording time. Such tags may belinks, hyperlinks, data, metadata, or any other indicator or softwareconstruct that connects or references the audio recording of thetelephonic communication. For example, as described in more detailbelow, network device 104 may include in the transcript a tag associatedwith each, or selected, words in the transcript. Alternatively, networkdevice 104 may label sentences, words, or portions of the transcript,and associate a tag with each label. For example, network device 104 mayidentify each speaker in the telephonic communication and add thespeaker's name or other identifier to the transcript. Network device 104may create and associate a tag with each name or identifier. Other itemswithin a transcript may have tags associated with them, and all suchitems are contemplated as within the scope of the present disclosure.

The tag associated with a word, name, identifier, etc., may serve as aconnector to an audio recording of the telephonic communication. Theaudio recording may be a separate file apart from the transcript, or itmay be embedded, integrated, or otherwise included within thetranscript. When such a tag is activated, it may initiate the playing ofthe audio recording. Such playing may include the launching of anapplication to play audio files. Activation of the tag may instruct theaudio recording, or a related application, to start playing therecording at the time that the word with which the tag is associated wasspoken in the telephonic communication. Alternatively, the recording orits associated application may be instructed to begin playing therecording at a predetermined amount of time before or after the timethat the word with which the tag is associated was spoken in thetelephonic communication. For example, if a user of the transcriptclicks on a tag associated with the word “budget”, the audio recordingor its associated application may be instructed to play the audiorecording starting ten seconds before that instance of the word “budget”was said in the telephonic communication. The timing of when to startplaying the audio recording of a telephonic communication in relation towhen the selected word was said may be configured by a user or anotherdevice, and may be adjustable at any point during the telephoniccommunication or during the use of the transcript.

Alternatively, the recording or its associated application may beinstructed to begin playing the recording at a predetermined pointbefore or after the time that the word with which the tag is associatedwas spoken in the telephonic communication. For example, if a user ofthe transcript clicks on a tag associated with the word “budget”, theaudio recording or its associated application may be instructed to playthe audio recording starting at the sentence in which that instance ofthe word “budget” was said in the telephonic communication. In yetanother alternative, the audio recording or its associated applicationmay be instructed to play the audio recording at the point when thespeaker who said that instance of the word “budget” most recentlystarted speaking in the telephonic communication. For example, if a userof the transcript clicks on a tag associated with the word “budget”, theaudio recording or its associated application may be instructed to playthe audio recording starting at the point where the speaker of thatinstance of the word “budget” began talking before the speaker spokethat particular instance of the word “budget” in the telephoniccommunication.

In some embodiments, network device 104 may detect specific words orcommands and take actions based on the detection. In one embodiment, acommand may be sent to network device 104 during the telephoniccommunication that instructs network device 104 to insert an indicatorinto the transcript. User 101 may send a tone to network device 104 (orto an intermediate device that translates the tone into a command) thatinstructs network device 104 to mark a portion of the text in a certainmanner. Network device 104 may be instructed to add highlighting to aportion of the transcript. Alternatively, network device 104 may beinstructed to add text, markings, or other indicators to the transcript.For example, user 101 may be given an action item during a conferencecall, and may press “*1” to generate an instruction to network device104 to mark the action item in the transcript. Network device 104 maythen mark a portion of the transcript, such as highlighting the textassociated with the last 30 seconds of the telephonic communicationbefore receiving the instruction. Alternatively, network device 104 mayinsert the words “ACTION ITEM” into the transcript in a manner thatmakes locating the action item within the transcript easier.

In another embodiment, network device 104 may detect certain words orsounds within the telephonic communication and, responsive to detectingthe certain words or sounds, insert an indicator into the transcript.For example, network device 104 may detect the words “action item” whensaid by a participant in the telephonic communication and mark the textassociated with a period of time before and/or after the detection ofthe words “action item”. In yet another alternative, network device 104may be instructed through an interface, such as a software applicationor a web page, to mark certain portions of the transcribed telephoniccommunication. All other means and methods of instructing network device104 to mark a portion of a transcript, and any markings, indicia,highlighting or any other means or methods of identifying a portion of atranscript may be used. All such means and methods are contemplated aswithin the scope of the present disclosure.

In other embodiments, actions may be taken to communicate a portion orthe entirety of the transcript to one or more users or devices. In oneembodiment, network device 104 may detect specific words or commands andcreate a text message or email message and transmit the message to auser. Network device 104 may include in the message a portion of thetranscribed telephonic communication. For example, after detecting avoice, tone, or other form of command, or after receiving a commandthrough an interface, network device 104 may include in a message thetranscribed portions of the telephonic communication that correspond totime periods of the telephonic communication before, during, and/orafter the command was detected or received. The time periods may bepredetermined and configured by an administrator, user or device.Alternatively, a specification of the time period to be captured in thetranscript portion included in the message may be specified by the useror device issuing the command.

In another embodiment, actions may be taken to communicate an audioportion or the entirety of the recorded telephonic communication to oneor more users or devices as a voice message. Such a voice message may besent to a participant in the telephonic communication, or to anotherparty who may be interested in the substance of the communication. Sucha voice message may be sent to a telephony device, such as a phone orvoicemail box, or may be attached to or otherwise included with amessage such as an email or text message. In one embodiment, networkdevice 104 may detect specific words or commands and create a clip of arecording of the telephonic transcript and transmit the clip to a user.Network device 104 may include with the clip a description of telephoniccommunication, such as a time, date, user names, or other identifyinginformation. For example, after detecting a voice, tone, or other formof command, or after receiving a command through an interface, networkdevice 104 may transmit one more portions of the recorded telephoniccommunication that correspond to time periods of the telephoniccommunication before, during, and/or after the command was detected orreceived. The time periods may be predetermined and configured by anadministrator, user or device. Alternatively, a specification of thetime period of the recorded telephonic communication to be captured maybe specified by the user or device issuing the command. Other means andmethods of determining the portion of the telephonic communication to besent in a voice message are contemplated as within the scope of thepresent disclosure.

In another embodiment, actions may be taken to add information to thetranscript. In one alternative, words may be detected by network device104 using voice or speech recognition technology and additionalinformation may be added based on the detection of the words. In anotherembodiment, a command may be received or detected by network device 104instructing network device 104 collect and/or provide additionalinformation in the transcript about the word or words detected. Forexample, an acronym may be detected, such as “IP”, and network device104 may integrate the meaning of the acronym into the transcript, suchas adding “[internet protocol]” following the transcription of “IP”.Alternatively, after a user hears the acronym “IP” during the telephoniccommunication, the user may issue a command through any of the meansdescribed herein instructing network device 104 to provide the meaningof the acronym “IP” in the transcript.

Alternatively, network device 104 may add other information or links toinformation about detected words. For example, if network device 104detects the word “BigCo”, or is instructed to provide information aboutthe word “BigCo” after it is detected in the telephonic communication,network device 104 may insert into the transcript, in one embodiment ator near the word “BigCo”, a hyperlink to BigCo's website, a hyperlink torecent news stories about BigCo, a listing of BigCo's most recent stockprice, a report generated by a participant about BigCo, and/or any otherinformation that may be available. Any type of information or connectionor link to such information may be provided.

In another embodiment, network device 104 may detect foreign languagewords and automatically provide a translation for such words.Alternatively, network device 104 may be instructed to provide atranslation of such words by a user using any of the command methods andmeans described herein. For example, network device 104 may detect thewords “merci beaucoup”, or may be instructed to provide a translation ofthe words ““merci beaucoup” after it is detected in the telephoniccommunication. Network device 104 may insert into the transcript, in oneembodiment at or near the words “merci beaucoup”, “[thank you verymuch]”.

In yet another embodiment, the tone or emotional content of the wordsspoken may be detected and actions taken based on the detection. Forexample, if a speaker is particularly happy-sounding, which may bedetermined using voice and/or speech recognition technology, networkdevice 104 may insert into the transcript, in one embodiment at or nearthe happy-sounding words, an emoticon or other indicator of the happyemotion associated with the words. This additional information may behelpful to a reader of the transcript who was not present during thetelephonic communication to get the full meaning of the words spoken.

In still another embodiment, a picture or video of the speaker may beincluded in the transcript. Speakers may be identified in any number ofways, including by a log-in used to enter into the telephoniccommunication, voice detection and identification, or any othereffective means. Network device 104 may, after identifying the speaker,insert a picture of the speaker obtained from any available source.Alternatively, as is more common, the telephonic communication may alsobe a video communication, with video available of one or more of theparticipants. Video clips of the speakers as they spoke the words in thetranscript may be included with the transcript, in one embodiment at ornear the words spoken. Alternatively, video of other participants in thetelephonic communication may be included with the words spoken by aparticipant, in one embodiment at or near the words spoken, so that auser of transcript can see the reactions of the other participants asthe words are spoken. The types of information and/or the method ofpresenting the information may be configured by a user, administrator,or device, and/or may be specified in a command instructing networkdevice 104 to provide the information. All such means and methods arecontemplated as within the scope of the present disclosure.

FIG. 2A illustrates a non-limiting, exemplary transcript 200, displayedon display 210. Transcript 200 may be a software or data version of atranscript. Transcript 200 may be generated using any of the meansdescribed herein, or any other effective means. Display 210 may be acomputer monitor, television, PDA display, mobile device display, or anyother type of display capable of displaying text.

Transcript 200 includes lines of text 220 that represent a transcribedportion of a telephonic communication. Transcript 200 may set out eachspeaker's words separately. Each speaker may be identified by a label,such as label 255 that identifies Speaker 1. The labels identifyingspeakers may also contain tags or links to audio portions of thetelephonic communication as described above. Each word, or selectedwords, within the transcript may also contain links or tags to audioportions of the telephonic communication as described above.

Words within the telephonic communication may be detected and actionstaken in response. Alternatively, commands may be issued to anintelligent call transcript system instructing the system to takeactions based on detected words. For example, key words 240, in thisexample comprising “action item”, may be detected by an intelligent calltranscript system. The intelligent call transcript system may,responsive to this detection, highlight lines 230 that are proximate tokey words 240. Key words 240 may themselves be highlighted or otherwiseidentified. In addition to, or instead of, the highlighting, theintelligent call transcript system may add a tag, identifier, or otherindicator, such as identifier 250, to the transcript to identify the keywords and/or the words or content proximate to the key words. Any typeof identifier, tag, or other means of making a portion of transcript 200stand out are contemplated as within the scope of the presentdisclosure.

In one embodiment, an abridged version of a transcript may be generatedwith only selected text versions of portions of the telephoniccommunication present, along with, in one embodiment, tags or otheradditional information. For example, transcript 200 may only contain thetext associated with the portions of the telephonic communication thatare associated with a key word or words, such as “action item”. Othermeans and methods of producing an abridged transcript are contemplatedas within the scope of the present disclosure.

In another embodiment, other information may be included based ondetected words, or at the instruction of a user or device. Key word 251,in this example comprising the word “BigCo”, may be associated with alink that, when activated, connects the user to a web site, news source,or other construct that provides more information about key word 251 orother subjects. Alternatively, when key words 240 are detected, a linkmay be created that, when activated, generates a message, such as anemail or text message, transmitting the portion of the transcript, suchas lines 230, to a user or device. The detection of key words, such askey word 251, may also instruct intelligent call transcript system toinsert additional information into the transcript, such as additionalinformation 253, in this example comprising the stock price of BigCostock. Any other useful information, such as a translation of a detectedforeign word, may be provided.

In another embodiment, acronyms may be expanded to provide their fullmeanings within the transcript. For example, acronym 257, in thisexample comprising the acronym “DRM”, may be detected and additionalinformation 259, in this example comprising “Digital Rights Management”,may be added to the transcript, in one embodiment at or proximate todetected acronym 257 or in another embodiment, compiled with otheracronyms to form a glossary of acronyms. Any other methods and means ofdetecting words or receiving instructions to take actions based ondetected or transcribed words may be used, as may any other means ormethods of taking actions. All such means and methods are contemplatedas within the scope of the present disclosure.

In some embodiments, an intelligent call transcript system may transmitportions of a transcript to users upon instruction by a user or device,or upon detection of one or more words or sounds. For example, when keywords 240 are detected, the intelligent call transcript system maygenerate message 280 shown in FIG. 2B. User 260 may receive the messageon mobile communications device 270, and may view the message on display271 of mobile communications device 270. Message 280 may be an email, atext message, or any other form of text content. Message 280 may alsoinclude the name of the recipient, an identifier of the sender and/orrequestor of the message, a subject line, and/or any other informationthat may be available. Message 280 may also include additionalinformation provided by an intelligent call transcript system, such aslinked words 285. Linked words 285 may include a link that, whenactivated by user 260, plays an audio clip of the portion of thetelephonic communication captured in message 280, plays video of one ormore of the speakers, connects user 260 to a web page or other softwareconstruct that may contain additional information, or provide any otherinformation or connections. All such embodiments are contemplated aswithin the scope of the present disclosure.

FIG. 3 illustrates a non-limiting, exemplary method 300 of implementingan intelligent call transcript system. At block 310, transcription of atelephonic communication is started. While the transcription isprogressing, at block 320, the intelligent call transcript system maydetect events that require or instruct the intelligent call transcriptsystem to take actions. For example, the intelligent call transcriptsystem may detect a key word or words that indicate certain steps are tobe taken, such as inserted the meaning of an acronym, the translation ofa key word, an emoticon, a link or tag, or other additional informationor data into the transcript. Alternatively, the intelligent calltranscript system may detect or receive a command or instruction from auser, administrator, or other device that instructs the intelligent calltranscript system to take an action. Any type of event or instructionthat can be detected or received by an intelligent call transcriptsystem is contemplated as within the scope of the present disclosure.

If, at block 320, no event is detected, at block 340 it is determinedwhether the telephonic communication is ongoing. If so, transcription iscontinued at block 350. The determination made at block 340 may alsoinclude determining if an instruction, command, or event has beendetected or received that instructs the intelligent call transcriptsystem to stop transcribing the telephonic communication. If an eventindicating that the intelligent call transcript system should stoptranscribing is detected, or if it is determined at block 340 that thetelephonic communication is not ongoing, then the method is exited.

Returning to block 320, if an event is detected that indicates to theintelligent call transcript system that an action should be taken, theaction is taken, or initiated, at block 330. The action may be any typeof action as described herein, such as translating a word or words,expanding on an acronym, adding identifiers, links, tags, highlighting,pictures, emoticon, and/or video to the transcript, transmitting anaudio, video, or text portion of the transcript to a user, terminatingthe telephonic communication, and/or any other type of action. All suchactions are contemplated as within the scope of the present disclosure.

After initiating the action at block 330, at block 340 the determinationis made as to whether the telephonic communication is ongoing. If so, atblock 350 the transcription continues. If the telephonic communicationhas been terminated, the method is exited.

In one embodiment, the intelligent call transcript systems, methods, andfeatures described above may be implemented on a user device, such asmobile communications device 111, 113, or 270, or wired communicationsdevice 112, rather than, or in addition to, a network device such asnetwork device 104. Alternatively, network device 104 may be a componentor subsystem of a communications device, such as mobile communicationsdevices 111, 113, or 270, or wired communications device 112. Thus, thefunctions and features described herein may be performed on acommunications device, such as transcribing and/or recording telephoniccommunications, receiving commands and/or instructions affecting theconfiguration and control of an intelligent call transcript system, andtransmitting and/or receiving transcribed and/or recorded telephoniccommunication data. All such embodiments, and any embodiments combiningwired and/or mobile communications devices and network devices, arecontemplated as within the scope of the present disclosure.

Note also that any of the methods, systems, and means described hereinmay be used for any type of audible or audio data and with anytechnology. For example, rather than telephonic communications, themethods and means recited herein may be applied to any type ofconversation, such as those in a recorded medium or those recordedwithout the participation of the user requesting the recording. Notealso that the intelligent call transcript system as described herein maybe integrated into other telephony or communications related systems anddevices, such as a conference call system, a wireless or wirelinecommunications system, or any other type of communications system ordevice. All such embodiments are contemplated as within the scope of thepresent disclosure.

FIG. 4 illustrates an example wireless device 1010 that may be used inconnection with an embodiment. References will also be made to otherfigures of the present disclosure as appropriate. For example, mobilecommunications devices 111, 113, and 270 may each be a wireless deviceof the type described in regard to FIG. 4, and may have some, all, ornone of the components and modules described in regard to FIG. 4. Itwill be appreciated that the components and modules of wireless device1010 illustrated in FIG. 4 are illustrative, and that any number andtype of components and/or modules may be present in wireless device1010. In addition, the functions performed by any or all of thecomponents and modules illustrated in FIG. 4 may be performed by anynumber of physical components. Thus, it is possible that in someembodiments the functionality of more than one component and/or moduleillustrated in FIG. 4 may be performed by any number or types ofhardware and/or software.

Processor 1021 may be any type of circuitry that performs operations onbehalf of wireless device 1010. In one embodiment, processor 1021executes software (i.e., computer readable instructions stored in acomputer readable medium) that may include functionality related tointelligent transcription of telephonic communications, communicatingwith, operating, or interfacing with an intelligent call transcriptsystem, and/or running software configured to operate, communicate, orinterface with an intelligent call transcript system, for example. Userinterface module 1022 may be any type or combination of hardware and/orsoftware that enables a user to operate and interact with wirelessdevice 1010, and, in one embodiment, to interact with an intelligentcall transcript system. For example, user interface module 1022 mayinclude a display, physical and “soft” keys, voice and/or speechrecognition software, microphone, speaker and the like. Wirelesscommunication module 1023 may be any type or combination of hardwareand/or software that enables wireless device 1010 to communicate with,for example, network 110 or any other type of wireless communicationsnetwork. Memory 1024 enables wireless device 1010 to store information,recorded and/or transcribed telephonic communication and additionalrelated data, voice and/or speech recognition software, transcriptionsoftware, a web browser, or the like. Memory 1024 may take any form,such as internal random access memory (RAM), an SD card, a microSD cardand the like. Power supply 1025 may be a battery or other type of powerinput (e.g., a charging cable that is connected to an electrical outlet,etc.) that is capable of powering wireless device 1010.

Transcription module 1026 may be any type or combination of hardwareand/or software that enables wireless device 1010 to intelligentlytranscribe telephonic communications. In one embodiment, processor 1021may perform the functions of transcription module 1026. In analternative embodiment, transcription module 1026 may be separate fromprocessor 1023.

FIG. 5 is a block diagram of an example processor 1158 which may beemployed in any of the embodiments described herein, including as one ormore components of mobile communications devices 111,113, or 270, orwired communications device 112, or as one or more components ofcommunications network equipment or related equipment, such as anycomponent of network 110, and/or as one or more components of any thirdparty system or subsystem that may implement any portion of the subjectmatter described herein, such as network device 104. It is emphasizedthat the block diagram depicted in FIG. 5 is exemplary and not intendedto imply a specific implementation. Thus, the processor 1158 can beimplemented in a single processor or multiple processors. Multipleprocessors can be distributed or centrally located. Multiple processorscan communicate wirelessly, via hard wire, or a combination thereof.

The processor 1158 comprises a processing portion 1160, a memory portion1162, and an input/output portion 1164. The processing portion 560,memory portion 562, and input/output portion 1164 are coupled together(coupling not shown in FIG. 5) to allow communications between theseportions. The input/output portion 1164 is capable of providing and/orreceiving components, commands, and/or instructions, utilized to, forexample, operate an intelligent call transcript system or any subsystemor component thereof.

The processor 1158 can be implemented as a client processor and/or aserver processor. In a basic configuration, the processor 1158 mayinclude at least one processing portion 1160 and memory portion 1162.The memory portion 1162 can store any information utilized inconjunction with transmitting, receiving, and/or processing intelligenttranscription instructions or commands, intelligent transcriptionapplications, telephonic communications, transcripts of telephoniccommunication, recorded telephonic communications, etc. For example, asdescribed above, the memory portion is capable of storing transcripts,recorded telephonic communications, and/or software capable of operatingan intelligent call transcript system. Depending upon the exactconfiguration and type of processor, the memory portion 1162 can bevolatile (such as RAM) 1166, non-volatile (such as ROM, flash memory,etc.) 1168, or a combination thereof. The processor 1158 can haveadditional features/functionality. For example, the processor 1158 caninclude additional storage (removable storage 1170 and/or non-removablestorage 1172) including, but not limited to, magnetic or optical disks,tape, flash, smart cards or a combination thereof. Computer storagemedia, such as memory and storage elements 1162, 1170, 1172, 1166, and1168, include volatile and nonvolatile, removable and non-removablemedia implemented in any method or technology for storage of informationsuch as computer readable instructions, data structures, programmodules, or other data. Computer storage media include, but are notlimited to, RAM, ROM, EEPROM, flash memory or other memory technology,CD-ROM, digital versatile disks (DVD) or other optical storage, magneticcassettes, magnetic tape, magnetic disk storage or other magneticstorage devices, universal serial bus (USB) compatible memory, smartcards, or any other medium which can be used to store the desiredinformation and which can be accessed by the processor 1158. Any suchcomputer storage media may be part of the processor 1158.

The processor 1158 can also contain the communications connection(s)1180 that allow the processor 1158 to communicate with other devices,for example through network 110. Communications connection(s) 1180 is anexample of communication media. Communication media typically embodycomputer-readable instructions, data structures, program modules orother data in a modulated data signal such as a carrier wave or othertransport mechanism and includes any information delivery media. Theterm “modulated data signal” means a signal that has one or more of itscharacteristics set or changed in such a manner as to encode informationin the signal. By way of example, and not limitation, communicationmedia includes wired media such as a wired network or direct-wiredconnection as might be used with a land line telephone, and wirelessmedia such as acoustic, RF, infrared, cellular, and other wirelessmedia. The term computer-readable media as used herein includes bothstorage media and communication media. The processor 1158 also can haveinput device(s) 1176 such as keyboard, keypad, mouse, pen, voice inputdevice, touch input device, etc. Output device(s) 1174 such as adisplay, speakers, printer, etc. also can be included.

Network 110 may comprise any appropriate telephony radio network, or anyother type of communications network, wireline or wireless, or anycombination thereof. The following description sets forth some exemplarytelephony radio networks, such as the global system for mobilecommunications (GSM), and non-limiting operating environments. Thebelow-described operating environments should be considerednon-exhaustive, however, and thus the below-described networkarchitectures merely show how IP cellular broadcast may be used withstationary and non-stationary network structures and architectures. Itcan be appreciated, however, that intelligent call transcript systemssuch as those described herein can be incorporated with existing and/orfuture alternative architectures for communication networks as well.

The GSM is one of the most widely utilized wireless access systems intoday's fast growing communication environment. The GSM providescircuit-switched data services to subscribers, such as mobile telephoneor computer users. The General Packet Radio Service (GPRS), which is anextension to GSM technology, introduces packet switching to GSMnetworks. The GPRS uses a packet-based wireless communication technologyto transfer high and low speed data and signaling in an efficientmanner. The GPRS attempts to optimize the use of network and radioresources, thus enabling the cost effective and efficient use of GSMnetwork resources for packet mode applications.

As one of ordinary skill in the art can appreciate, the exemplaryGSM/GPRS environment and services described herein also can be extendedto 3G services, such as Universal Mobile Telephone System (UMTS),Frequency Division Duplexing (FDD) and Time Division Duplexing (TDD),High Speed Packet Data Access (HSPDA), cdma2000 1x Evolution DataOptimized (EVDO), Code Division Multiple Access-2000 (cdma2000 3x), TimeDivision Synchronous Code Division Multiple Access (TD-SCDMA), WidebandCode Division Multiple Access (WCDMA), Enhanced Data GSM Environment(EDGE), International Mobile Telecommunications-2000 (IMT-2000), DigitalEnhanced Cordless Telecommunications (DECT), 4G Services such as LongTerm Evolution (LTE), etc., as well as to other network services thatbecome available in time. In this regard, the systems and methods of anintelligent call transcript system can be applied independently of themethod of data transport, and do not depend on any particular networkarchitecture, or underlying protocols.

FIG. 6 depicts an overall block diagram of an exemplary packet-basedmobile cellular network environment, such as a GPRS network, in whichintelligent call transcript systems such as those described herein canbe practiced. In an example configuration, network 110 may beencompassed by the network environment depicted in FIG. 6. In such anenvironment, there may be a plurality of Base Station Subsystems (BSS)900 (only one is shown), each of which comprises a Base StationController (BSC) 902 serving a plurality of Base Transceiver Stations(BTS) such as BTSs 904, 906, and 908. BTSs 904, 906, 908, etc. are theaccess points where users of packet-based mobile devices (e.g., mobilecommunications device 111 or 113) become connected to the wirelessnetwork. In exemplary fashion, the packet traffic originating from userdevices (e.g., mobile communications devices 111 and 113) may betransported via an over-the-air interface to a BTS 908, and from the BTS908 to the BSC 902. Base station subsystems, such as BSS 900, may be apart of internal frame relay network 910 that can include Service GPRSSupport Nodes (SGSN) such as SGSN 912 and 914. Each SGSN may beconnected to an internal packet network 920 through which a SGSN 912,914, etc. may route data packets to and from a plurality of gateway GPRSsupport nodes (GGSN) 922, 924, 926, etc. As illustrated, SGSN 914 andGGSNs 922, 924, and 926 may be part of internal packet network 920.Gateway GPRS serving nodes 922, 924 and 926 may provide an interface toexternal Internet Protocol (IP) networks, such as Public Land MobileNetwork (PLMN) 950, corporate intranets 940, or Fixed-End System (FES)or the public Internet 930. As illustrated, subscriber corporate network940 may be connected to GGSN 924 via firewall 932; and PLMN 950 may beconnected to GGSN 924 via border gateway router 934. The RemoteAuthentication Dial-In User Service (RADIUS) server 942 may be used forcaller authentication when a user of a mobile cellular device callscorporate network 940.

Generally, there can be four different cell sizes in a GSM network,referred to as macro, micro, pico, and umbrella cells. The coverage areaof each cell is different in different environments. Macro cells may beregarded as cells in which the base station antenna is installed in amast or a building above average roof top level. Micro cells are cellswhose antenna height is under average roof top level. Micro-cells may betypically used in urban areas. Pico cells are small cells having adiameter of a few dozen meters. Pico cells may be used mainly indoors.On the other hand, umbrella cells may be used to cover shadowed regionsof smaller cells and fill in gaps in coverage between those cells.

FIG. 7 illustrates an architecture of a typical GPRS network segmentedinto four groups: users 1050, radio access network 1060, core network1070, and interconnect network 1080. Users 1050 may comprise a pluralityof end users (though only mobile subscriber 1055 is shown in FIG. 7). Inan example embodiment, the device depicted as mobile subscriber 1055,which may be a network within network 110 or that connects to network110, may comprise mobile communications devices 111, 113, and/or 270.Radio access network 1060 comprises a plurality of base stationsubsystems such as BSSs 1062, which include BTSs 1064 and BSCs 1066.Core network 1070, which may be a network within network 110 or thatconnects to network 110, comprises a host of various network elements.As illustrated here, core network 1070 may comprise Mobile SwitchingCenter (MSC) 1071, Service Control Point (SCP) 1072, gateway MSC 1073,SGSN 1076, Home Location Register (HLR) 1074, Authentication Center(AuC) 1075, Domain Name Server (DNS) 1077, and GGSN 1078. Interconnectnetwork 1080, which may be a network within network 110 or that connectsto network 110, may also comprise a host of various networks and othernetwork elements. As illustrated in FIG. 7, interconnect network 1080comprises Public Switched Telephone Network (PSTN) 1082, Fixed-EndSystem (FES) or Internet 1084, firewall 1088, and Corporate Network1089.

A mobile switching center may be connected to a large number of basestation controllers. At MSC 1071, for instance, depending on the type oftraffic, the traffic may be separated in that voice may be sent toPublic Switched Telephone Network (PSTN) 1082 through Gateway MSC (GMSC)1073, and/or data may be sent to SGSN 1076, which then sends the datatraffic to GGSN 1078 for further forwarding.

When MSC 1071 receives call traffic, for example, from BSC 1066, it maysend a query to a database hosted by SCP 1072. The SCP 1072 may processthe request and may issue a response to MSC 1071 so that it may continuecall processing as appropriate.

The HLR 1074 may be a centralized database for users to register to theGPRS network. HLR 1074 may store static information about thesubscribers such as the International Mobile Subscriber Identity (IMSI),subscribed services, and a key for authenticating the subscriber. HLR1074 may also store dynamic subscriber information such as the currentlocation of the mobile subscriber. HLR 1074 may also serve to interceptand determine the validity of destination numbers in messages sent froma device, such as mobile subscriber 1055, as described herein.Associated with HLR 1074 may be AuC 1075. AuC 1075 may be a databasethat contains the algorithms for authenticating subscribers and mayinclude the associated keys for encryption to safeguard the user inputfor authentication.

In the following, depending on context, the term “mobile subscriber”sometimes refers to the end user and sometimes to the actual portabledevice, such as mobile communications devices 111, 113, or 270 used byan end user of the mobile cellular service. When a mobile subscriberturns on his or her mobile device, the mobile device may go through anattach process by which the mobile device attaches to an SGSN of theGPRS network. In FIG. 7, when mobile subscriber 1055 initiates theattach process by turning on the network capabilities of the mobiledevice, an attach request may be sent by mobile subscriber 1055 to SGSN1076. The SGSN 1076 queries another SGSN, to which mobile subscriber1055 was attached before, for the identity of mobile subscriber 1055.Upon receiving the identity of mobile subscriber 1055 from the otherSGSN, SGSN 1076 may request more information from mobile subscriber1055. This information may be used to authenticate mobile subscriber1055 to SGSN 1076 by HLR 1074. Once verified, SGSN 1076 sends a locationupdate to HLR 1074 indicating the change of location to a new SGSN, inthis case SGSN 1076. HLR 1074 may notify the old SGSN, to which mobilesubscriber 1055 was attached before, to cancel the location process formobile subscriber 1055. HLR 1074 may then notify SGSN 1076 that thelocation update has been performed. At this time, SGSN 1076 sends anAttach Accept message to mobile subscriber 1055, which in turn sends anAttach Complete message to SGSN 1076.

After attaching itself with the network, mobile subscriber 1055 may thengo through the authentication process. In the authentication process,SGSN 1076 may send the authentication information to HLR 1074, which maysend information back to SGSN 1076 based on the user profile that waspart of the user's initial setup. The SGSN 1076 may then send a requestfor authentication and ciphering to mobile subscriber 1055. The mobilesubscriber 1055 may use an algorithm to send the user identification(ID) and password to SGSN 1076. The SGSN 1076 may use the same algorithmand compares the result. If a match occurs, SGSN 1076 authenticatesmobile subscriber 1055.

Next, the mobile subscriber 1055 may establish a user session with thedestination network, corporate network 1089, by going through a PacketData Protocol (PDP) activation process. Briefly, in the process, mobilesubscriber 1055 may request access to the Access Point Name (APN), forexample, UPS.com, and SGSN 1076 may receive the activation request frommobile subscriber 1055. SGSN 1076 may then initiate a Domain NameService (DNS) query to learn which GGSN node has access to the UPS.comAPN. The DNS query may be sent to the DNS server within the core network1070, such as DNS 1077, which may be provisioned to map to one or moreGGSN nodes in the core network 1070. Based on the APN, the mapped GGSN1078 can access the requested corporate network 1089. The SGSN 1076 maythen send to GGSN 1078 a Create Packet Data Protocol (PDP) ContextRequest message that contains necessary information. The GGSN 1078 maysend a Create PDP Context Response message to SGSN 1076, which may thensend an Activate PDP Context Accept message to mobile subscriber 1055.

Once activated, data packets of the call made by mobile subscriber 1055may then go through radio access network 1060, core network 1070, andinterconnect network 1080, in a particular fixed-end system, or Internet1084 and firewall 1088, to reach corporate network 1089.

Thus, network elements that can invoke the functionality of intelligentcall transcript systems and methods such as those described herein caninclude, but are not limited to, Gateway GPRS Support Node tables, FixedEnd System router tables, firewall systems, VPN tunnels, and any numberof other network elements as required by the particular digital network.

FIG. 8 illustrates another exemplary block diagram view of a GSM/GPRS/IPmultimedia network architecture 1100 in which intelligent calltranscript systems such as those described herein can be incorporated.As illustrated, architecture 1100 of FIG. 8 includes a GSM core network1101, a GPRS network 1130 and an IP multimedia network 1138, any or allof which may be networks within network 110 or networks that connect tonetwork 110. The GSM core network 1101 includes a Mobile Station (MS)1102, at least one Base Transceiver Station (BTS) 1104 and a BaseStation Controller (BSC) 1106. The MS 1102 is physical equipment orMobile Equipment (ME), such as a mobile telephone or a laptop computer(e.g., mobile communications device 111, 113, or 270) that is used bymobile subscribers, with a Subscriber identity Module (SIM). The SIMincludes an International Mobile Subscriber Identity (IMSI), which is aunique identifier of a subscriber. The BTS 1104 may be physicalequipment, such as a radio tower, that enables a radio interface tocommunicate with the MS. Each BTS may serve more than one MS. The BSC1106 may manage radio resources, including the BTS. The BSC may beconnected to several BTSs. The BSC and BTS components, in combination,are generally referred to as a base station (BSS) or radio accessnetwork (RAN) 1103.

The GSM core network 1101 may also include a Mobile Switching Center(MSC) 1108, a Gateway Mobile Switching Center (GMSC) 1110, a HomeLocation Register (HLR) 1112, Visitor Location Register (VLR) 1114, anAuthentication Center (AuC) 1118, and an Equipment Identity Register(EIR) 1116. The MSC 1108 may perform a switching function for thenetwork. The MSC may also perform other functions, such as registration,authentication, location updating, handovers, and call routing. The GMSC1110 may provide a gateway between the GSM network and other networks,such as an Integrated Services Digital Network (ISDN) or Public SwitchedTelephone Networks (PSTNs) 1120. Thus, the GMSC 1110 providesinterworking functionality with external networks.

The HLR 1112 is a database that may contain administrative informationregarding each subscriber registered in a corresponding GSM network.Such information may also include call replay preferences for eachsubscriber. The HLR 1112 may also contain the current location of eachMS. The VLR 1114 may be a database that contains selected administrativeinformation from the HLR 1112. The VLR may contain information necessaryfor call control and provision of subscribed services for each MScurrently located in a geographical area controlled by the VLR. The VLRmay also contain intelligent call transcription preferences for eachsubscriber. The HLR 1112 and the VLR 1114, together with the MSC 1108,may provide the call routing and roaming capabilities of GSM, as well asintelligent call transcription functionality. The AuC 1116 may providethe parameters needed for authentication and encryption functions. Suchparameters allow verification of a subscriber's identity. The EIR 1118may store security-sensitive information about the mobile equipment.

A Short Message Service Center (SMSC) 1109 allows one-to-one shortmessage service (SMS), or multimedia message service (MMS), messages tobe sent to/from the MS 1102. A Push Proxy Gateway (PPG) 1111 is used to“push” (i.e., send without a synchronous request) content to the MS1102. The PPG 1111 acts as a proxy between wired and wireless networksto facilitate pushing of data to the MS 1102. A Short Message Peer toPeer (SMPP) protocol router 1113 may be provided to convert SMS-basedSMPP messages to cell broadcast messages. SMPP is a protocol forexchanging SMS messages between SMS peer entities such as short messageservice centers. The SMPP protocol is often used to allow third parties,e.g., content suppliers such as news organizations, to submit bulkmessages.

To gain access to GSM services, such as voice, data, short messageservice (SMS), and multimedia message service (MMS), the MS may firstregister with the network to indicate its current location by performinga location update and IMSI attach procedure. The MS 1102 may send alocation update including its current location information to theMSC/VLR, via the BTS 1104 and the BSC 1106. The location information maythen be sent to the MS's HLR. The HLR may be updated with the locationinformation received from the MSC/VLR. The location update may also beperformed when the MS moves to a new location area. Typically, thelocation update may be periodically performed to update the database aslocation updating events occur.

The GPRS network 1130 may be logically implemented on the GSM corenetwork architecture by introducing two packet-switching network nodes,a serving GPRS support node (SGSN) 1132, a cell broadcast and a GatewayGPRS support node (GGSN) 1134. The SGSN 1132 may be at the samehierarchical level as the MSC 1108 in the GSM network. The SGSN maycontrol the connection between the GPRS network and the MS 1102. TheSGSN may also keep track of individual MS's locations and securityfunctions and access controls.

A Cell Broadcast Center (CBC) 1133 may communicate cell broadcastmessages that are typically delivered to multiple users in a specifiedarea. Cell Broadcast is one-to-many geographically focused service. Itenables messages to be communicated to multiple mobile telephonecustomers who are located within a given part of its network coveragearea at the time the message is broadcast.

The GGSN 1134 may provide a gateway between the GPRS network and apublic packet network (PDN) or other IP networks 1136. That is, the GGSNmay provide interworking functionality with external networks, and setup a logical link to the MS through the SGSN. When packet-switched dataleaves the GPRS network, it may be transferred to an external TCP-IPnetwork 1136, such as an X.25 network or the Internet. In order toaccess GPRS services, the MS first attaches itself to the GPRS networkby performing an attach procedure. The MS then activates a packet dataprotocol (PDP) context, thus activating a packet communication sessionbetween the MS, the SGSN, and the GGSN.

In a GSM/GPRS network, GPRS services and GSM services may be used inparallel. The MS may operate in one three classes: class A, class B, andclass C. A class A MS may attach to the network for both GPRS servicesand GSM services simultaneously. A class A MS may also supportsimultaneous operation of GPRS services and GSM services. For example,class A mobiles may receive GSM voice/data/SMS calls and GPRS data callsat the same time.

A class B MS may attach to the network for both GPRS services and GSMservices simultaneously. However, a class B MS does not supportsimultaneous operation of the GPRS services and GSM services. That is, aclass B MS can only use one of the two services at a given time.

A class C MS can attach for only one of the GPRS services and GSMservices at a time. Simultaneous attachment and operation of GPRSservices and GSM services is not possible with a class C MS.

A GPRS network 1130 may be designed to operate in three networkoperation modes (NOM1, NOM2 and NOM3). A network operation mode of aGPRS network may be indicated by a parameter in system informationmessages transmitted within a cell. The system information messages maydirect a MS where to listen for paging messages and how to signaltowards the network. The network operation mode represents thecapabilities of the GPRS network. In a NOM1 network, a MS can receivepages from a circuit switched domain (voice call) when engaged in a datacall. The MS can suspend the data call or take both simultaneously,depending on the ability of the MS. In a NOM2 network, a MS may notreceive pages from a circuit switched domain when engaged in a datacall, since the MS is receiving data and is not listening to a pagingchannel. In a NOM3 network, a MS can monitor pages for a circuitswitched network while receiving data and vice versa.

IP multimedia network 1138 was introduced with 3GPP Release 5, and mayinclude an IP multimedia subsystem (IMS) 1140 to provide rich multimediaservices to end users. IP multimedia network 1138 may be a component ofnetwork 110, or may connect to network 110. A representative set of thenetwork entities within the IMS 1140 are a call/session control function(CSCF), a media gateway control function (MGCF) 1146, a media gateway(MGW) 1148, and a master subscriber database, called a home subscriberserver (HSS) 1150. The HSS 1150 may be common to the GSM core network1101, the GPRS network 1130 as well as the IP multimedia network 1138.

The IP multimedia system 1140 may be built around the call/sessioncontrol function, of which there are three types: an interrogating CSCF(I-CSCF) 1143, a proxy CSCF (P-CSCF) 1142, and a serving CSCF (S-CSCF)1144. The P-CSCF 1142 is the MS's first point of contact with the IMS1140. The P-CSCF 1142 may forward session initiation protocol (SIP)messages received from the MS to an SIP server in a home network (andvice versa) of the MS. The P-CSCF 1142 may also modify an outgoingrequest according to a set of rules defined by the network operator (forexample, address analysis and potential modification).

The I-CSCF 1143 forms an entrance to a home network and hides the innertopology of the home network from other networks and providesflexibility for selecting an S-CSCF. The I-CSCF 1143 may contact asubscriber location function (SLF) 1145 to determine which HSS 1150 touse for the particular subscriber, if multiple HSSs 1150 are present.The S-CSCF 1144 may perform the session control services for the MS1102. This includes routing originating sessions to external networksand routing terminating sessions to visited networks. The S-CSCF 1144may also decide whether an application server (AS) 1152 is required toreceive information on an incoming SIP session request to ensureappropriate service handling. This decision is based on informationreceived from the HSS 1150 (or other sources, such as an applicationserver 1152). The AS 1152 may also communicate to a location server 1156(e.g., a Gateway Mobile Location Center (GMLC)) that provides a position(e.g., latitude/longitude coordinates) of the MS 1102.

The HSS 1150 may contain a subscriber profile, including intelligentcall transcription preferences, and keep track of which core networknode is currently handling the subscriber. It may also supportsubscriber authentication and authorization functions (AAA). In networkswith more than one HSS 1150, a subscriber location function providesinformation on the HSS 1150 that contains the profile of a givensubscriber.

The MGCF 1146 may provide interworking functionality between SIP sessioncontrol signaling from the IMS 1140 and ISUP/BICC call control signalingfrom the external GSTN networks (not shown.) It may also control themedia gateway (MGW) 1148 that provides user-plane interworkingfunctionality (e.g., converting between AMR- and PCM-coded voice.) TheMGW 1148 may also communicate with other IP multimedia networks 1154.

Push to Talk over Cellular (PoC) capable mobile telephones may registerwith the wireless network when the telephones are in a predefined area(e.g., job site, etc.) When the mobile telephones leave the area, theymay register with the network in their new location as being outside thepredefined area. This registration, however, does not indicate theactual physical location of the mobile telephones outside thepre-defined area.

While example embodiments of intelligent call transcript systems andmethods such as those described herein have been described in connectionwith various communications devices and computing devices/processors,the underlying concepts can be applied to any communications orcomputing device, processor, or system capable of implementingintelligent call transcript systems and methods described. The varioustechniques described herein can be implemented in connection withhardware or software or, where appropriate, with a combination of both.Thus, the methods and apparatuses for intelligent call transcriptsystems and methods, or certain aspects or portions thereof, can takethe form of program code (i.e., instructions) embodied in tangiblemedia, such as floppy diskettes, CD-ROMs, hard drives, or any othermachine-readable storage medium, wherein, when the program code isloaded into and executed by a machine, such as a computer, the machinebecomes an apparatus for an intelligent call transcript system. In thecase of program code execution on programmable computers, the computingdevice will generally include a processor, a storage medium readable bythe processor (including volatile and non-volatile memory and/or storageelements), at least one input device, and at least one output device.The program(s) can be implemented in assembly or machine language, ifdesired. The language can be a compiled or interpreted language, andcombined with hardware implementations.

The methods and systems for intelligent call transcription as describedherein can also be practiced via communications embodied in the form ofprogram code that is transmitted over some transmission medium, such asover electrical wiring or cabling, through fiber optics, or via anyother form of transmission, wherein, when the program code is receivedand loaded into and executed by a machine, such as an EPROM, a gatearray, a programmable logic device (PLD), a client computer, or thelike, the machine becomes an apparatus for an intelligent calltranscript system. When implemented on a general-purpose processor, theprogram code combines with the processor to provide a unique apparatusthat operates to invoke the functionality of an intelligent calltranscript system. Additionally, any storage techniques used inconnection with an intelligent call transcript system can invariably bea combination of hardware and software.

While the intelligent call transcript systems and methods have beendescribed in connection with the various embodiments of the variousfigures, it is to be understood that other similar embodiments can beused or modifications and additions can be made to the describedembodiments for performing the same functions of an intelligent calltranscript system without deviating from the described systems andmethods. For example, one skilled in the art will recognize that anintelligent call transcript system as described in the presentapplication may apply to any environment, whether wired or wireless, andmay be applied to any number of such devices connected via one or morecommunications networks and interacting across such networks. Therefore,intelligent call transcript systems such as those described hereinshould not be limited to any single embodiment, but rather should beconstrued in breadth and scope in accordance with the appended claims.

What is claimed is:
 1. A method comprising: receiving, on a networkdevice, a telephonic communication comprising speech; transcribing, onthe network device, the telephonic communication to generate atranscript; detecting, on the network device, an instruction generatedby a telephone within the telephonic communication; determining, on thenetwork device, that the telephone is associated with an authorizeduser; and responsive to detecting the instruction and determining thatthe telephone is associated with the authorized user, supplementing, onthe network device, the transcript with additional information.
 2. Themethod of claim 1, wherein the additional information comprises at leastone of a tag, highlighting, a hyperlink, text, or a marking.
 3. Themethod of claim 1, wherein supplementing the transcript with theadditional information comprises supplementing a portion of thetranscript associated with a predetermined amount of time with theadditional information.
 4. The method of claim 1, wherein the additionalinformation comprises a hyperlink that links to a portion of an audiorecording of the telephonic communication.
 5. The method of claim 1,further comprising, responsive to detecting the instruction,transmitting a communication.
 6. The method of claim 5, wherein thecommunication comprises a portion of the transcript.
 7. The method ofclaim 5, wherein the communication comprises a portion of the telephoniccommunication.
 8. A system comprising: a memory comprising instructions;and a processor coupled to the memory, wherein, when executing theinstructions, the processor effectuates operations comprising: receivinga telephonic communication comprising speech; transcribing thetelephonic communication to generate a transcript; detecting aninstruction generated by a telephone within the telephoniccommunication; determining that the telephone is associated with anauthorized user; and supplementing the transcript with additionalinformation responsive to detecting the instruction and determining thatthe telephone is associated with the authorized user.
 9. The system ofclaim 8, wherein the additional information comprises at least one of atag, highlighting, a hyperlink, text, or a marking.
 10. The system ofclaim 8, wherein the operation of supplementing the transcript with theadditional information comprises supplementing a portion of thetranscript associated with a predetermined amount of time with theadditional information.
 11. The system of claim 8, wherein theadditional information comprises a hyperlink that links to an audiorecording of the telephonic communication.
 12. The system of claim 8,wherein the operations further comprise, responsive to detecting theinstruction, transmitting a communication.
 13. The system of claim 12,wherein the communication comprises a portion of the transcript.
 14. Thesystem of claim 12, wherein the communication comprises a portion of thetelephonic communication.
 15. A computer-readable storage medium that isnot a transient signal, the computer-readable storage medium comprisingexecutable instructions, which when executed by a processor, cause theprocessor to effectuate operations comprising: receiving a telephoniccommunication comprising speech; transcribing the telephoniccommunication to generate a transcript; detecting an instructiongenerated by a telephone within the telephonic communication;determining that the telephone is associated with an authorized user;and responsive to detecting the instruction and determining that thetelephone is associated with the authorized user, supplementing thetranscript with additional information.
 16. The computer-readablestorage medium of claim 15, wherein the additional information comprisesat least one of a tag, highlighting, a hyperlink, text, or a marking.17. The computer-readable storage medium of claim 15, wherein theoperation of supplementing the transcript with the additionalinformation comprises supplementing a portion of the transcriptassociated with a predetermined amount of time with the additionalinformation.
 18. The computer-readable storage medium of claim 15,wherein the operations further comprise, responsive to detecting theinstruction, transmitting a communication.
 19. The computer-readablestorage medium of claim 18, wherein the communication comprises aportion of the transcript.
 20. The computer-readable storage medium ofclaim 18, wherein the communication comprises a portion of thetelephonic communication.